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Abstract 

In this paper we study the possible microscopic origin of heavy-tailed 
probability density distributions for the price variation of financial in- 
struments. We extend the standard log-normal process to include an- 
other random component in the so-called stochastic volatility models. We 
study these models under an assumption, akin to the Born-Oppenheimer 
approximation, in which the volatility has already relaxed to its equilib- 
rium distribution and acts as a background to the evolution of the price 
process. In this approximation, we show that all models of stochastic 
volatility should exhibit a scaling relation in the time lag of zero-drift 
modified log-returns. We verify that the Dow- Jones Industrial Average 
index indeed follows this scaling. We then focus on two popular stochastic 
volatility models, the Heston and Hull- White models. In particular, we 
show that in the Hull- White model the resulting probability distribution 
of log-returns in this approximation corresponds to the Tsallis (t-Student) 
distribution. The Tsallis parameters are given in terms of the microscopic 
stochastic volatility model. Finally, we show that the log-returns for 30 
years Dow Jones index data is well fitted by a Tsallis distribution, obtain- 
ing the relevant parameters. 



1 Introduction 

As any mature field, Finance has adopted a simple model developed over the 
years that attempts to describe the behaviour of random time fluctuations in 
the prices of commodities or stocks observed in the markets (T]. This model, 
which one could call the Standard Model of Finance (SMF), in spite of its many 
shortcomings has established a common language in a specific framework that 
immediately allows for generalizations. It predicts, for instance, prices for con- 
tracts on stocks, usually called derivative contracts [2J. The SMF assumes that 
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the fluctuations of the stock prices follow a log-normal probability distribu- 
tion function. It was first suggested by Osborne in 1959 [3] and independently 
by Samuelson [3] , improving on an earlier model by guaranteeing non-negative 
stock prices. 

In recent years, the large amount of financial data available has prompted 
many empirical investigations of the probability distributions of returns, defined 
as the logarithm of the ratio of stock prices separated by a given time lag. The 
simple log-normal assumption of the SFM would predict a Gaussian distribu- 
tion for the returns with variance growing linearly with the time lag. What 
is actually found is that the probability distribution for high frequency data 
usually deviates from normality, presenting heavy tails. Fits have been made 
using the truncated Levy distribution [3] and the so-called Tsallis (or t-Student) 
distribution [6]. 

An important question that we want to address is the nature of the micro- 
scopic stochastic process that may lead to these non-Gaussian distributions. For 
example, Borland proposed a feedback process where the diffusion coefficient is 
related to the macroscopic probability distribution, resulting in a non-linear 
Fokker-Planck equation whose solution is a time-dependent Tsallis distribution 

m- 

In this paper we study an extension of the SFM in which the diffusion of the 
stock price is itself represented by another independent stochastic process. In 
a very general way, one can divide these models into continuous- time stochas- 
tic volatility models and discrete-time Generalized Autoregressive Conditional 
Heteroskedasticity (GARCH) models [§]. Both classes can in principle describe 
some stylized facts such as volatility clustering [9]. The second class of models, 
used in standard econometric analysis, is based on time series data where pre- 
vious values of the volatility and stock prices are used to calculate subsequent 
values. 

In the following we will assume that volatility is not directly observed, and 
hence we will work in the context of the first class of models, where volatility 
can be interpreted as a hidden Markov process [TO]. We show that in these 
stochastic volatility models there is a useful assumption that can be made and 
which results in probability distribution of returns with heavy tails and a simple 
scaling time dependence. In particular, we find that the Tsallis distribution 
follows from the Hull- White stochastic volatility model in this case. We test 
this assumption with daily data from the Dow Jones Industrial Average index 
and find a better agreement than the SFM. 

2 Coupled stochastic processes for log-return and 
volatility 

The price of a stock, s(t), is a stochastic variable on the top of a deterministic 
exponential growth (with inflation rate fi > 0). It is therefore customary to 



2 



regard the modified log-return, 



x = In — — fit, (1) 
so 

as the indicator variable. It follows a Langevin equation of type 

dx = -a(v)dt + VvdWx (2) 

with dWi being a Wiener process with zero mean and variance dt and a — 
y/v is called the volatility. The function a(v) reflects the prescription when 
deriving the x-process with additive noise from the s-process with multiplicative 
noise. In the Ito-scheme it is a(v) = v/2. There exist, however, one particular 
Stratonovich scheme where it vanishes a(v) = 0. Another possibility is to modify 
the equation (Q} by adding a factor 1/2 J dt v(t). 

Equation ((2]) defines the SFM. An extension of the SFM considers the pos- 
sibility that the variance v — a 2 itself is governed by stochastic effects, as 
suggested by phenomenological observations on financial markets The 
proposed models have a first order deterministic part causing an exponential 
approach to the mean volatility, 9 and a noise term possibly influenced by the 
volatility itself: 

dv = -7O - 9)dt + b(v)dW 2 . (3) 

The second Wiener process, dWi-, may or may not be correlated with the first 
one. We will consider two main models of stochastic volatility developed in the 
finance literature, namely the Heston and the Hull- White models. The Heston 
model uses b(v) = Ka/u [12] and in the Hull- White model b(v) = nv [15] . 

Our purpose is to investigate approximations which enable a simplified treat- 
ment predicting the distribution of x (or s) as a function of time. 



3 Born-Oppenheimer approximation 

Assuming that one of the coupled dynamical stochastic variables (v) reaches its 
stationary distribution and then acts for the dynamics of the other (x) as an in- 
stantaneous, time-independent background is analog to the Born-Oppcnhcimcr 
(BO) approximation applied successfully in solid state and atomic physics. We 
will make this assumption in the remaining of the paper, explore its conse- 
quences and test it against data. The v -process can be solved in itself; the 
corresponding Fokker-Planck equation is given as 

BT\ fl 1 f) 2 

- + m « v - m+ -^* m = 0. (4) 
The stationary detailed balance solution satisfies 

— b 2 (v)U = -2j{v-0)IL. (5) 
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The solution, the balance distribution of v is given by 

According to the BO-approximation we average over this stationary balance 
probability of v the solution of the diffusion process of eq.([2|) at a given v, 

We obtain an approximation for the time evolution of the log-return probability: 

P t (x) = dvIL(v)P(x;v,t). (8) 
Jo 

Collecting all terms together we arrive at 

Pt{x) = 7ml dv ww=v exp { — — 21 1 i?w) dv ) • (9) 



4 Scaling in the BO approximation 

In the case of a compensated Ito-term, i.e. a(v) = 0, the dependence on the time 
lag t in eq. ([9]) is via the combination x 2 /2t and a in the normalization: 

The scaling hypothesis (|10p can be checked by detrending and normalizing 
the data. First one considers the log-return with a unit time-lag as 

& = ln^±i. (11) 

Si 

These values usually show a trend with the index i for a fixed t. We subtract 
the best fitted linear trend by the Gaussian least square method. From 

(& - a - b * i) 2 = min (12) 

i 

it follows by derivation with respect to the fit parameters a and b: 

J2&-a-b*i) = 0, 

i 

^2(d-a-b*i)*i = 0. (13) 



4 



From this we obtain 



b = 



N ■ 



6 T - eO 



with 



— N+l 

a = ti-b^± (14) 



Next we compose the subtracted data as 

Hi = & - a - b*i. (16) 

These have by construction zero mean, (y\ = 0) and zero index expectation 
value Q>^< *2/i — 0)- Finally the variance is normalized to one by considering 



(17) 



VJ2 k vl 

The Xi we call (linearly) detrended and normalized log return data. 

The distribution of these data is obtained by binning the Xi values: the 
distance between the minimal and maximal Xi is divided into B intervals, so 
that Xk — x min + k(x max — x m - m )/B is the beginning of the k — th interval. 
Whenever a given Xi is between and X^+i the counter is increased by 
one. The normalization J^k Pk — N is checked and J2 k kPk is calculated. The 
normalized distribution is reconstructed as Pk/N = P(Xf.+\/2)- We have tested 
the binning program with hoax data constructed from a normal Wiener process. 

In order to check the scaling hypothesis we use the daily closing values of the 
Dow Jones Industrial Average (DJIA) index from January 1976 to December 
2006, with N = 5930 data points and we adopt B = 100 bins. In figure [1] we 
show the index time series and the detrended and normalized returns. 

Figure[2]presents the detrended and normalized log-returns at different time- 
lags from 1 day to 250 days. The closeness of these data indicate that the scaling 
(detrending) is successful. 

The universal scaling for a(v) = case is (cf. ea lTUl) 

* w <18) 

for any time-lag t. This is testable on the detrended data with different time- 
lags. 

In Figure [3] the log returns = ln(sj+t/sj) for a given time lag t show 
an average trend. The at and bt parameters are obtained for several different 
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Dow Jones 1976-2006, s =858.71 a=3.527E-5 b=1.326E-7 
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Figure 1 : Time data of Dow Jones daily closing average St , showing the trend to 
be subtracted in our procedure (top panel) and the detrended and normalized 
log return xt (bottom panel) for 30 year data. 
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Figure 2: The distribution of detrended and normalized log-returns Xi(t) with 
different time lags t for Dow Jones data 1996-2006. The near coincidence of 
points indicates scaling and a(v) ~ 0. 
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time lags (from 1 to 500 days), as well as the width of detrended data before 
normalization. The average of & (t) on the top follows the straight line ^ (t) = 
at + bt(N + l — i)/2 = fit with /i w 4.35 TO -4 meaning a yearly 8.5% earnings (an 
average year had 5930/30=197 working days of the borse). The bottom picture 
shows the width of detrended data which is proportional to the square root of the 
time-lag on a double logarithmic scale. The fulfillment of this proportionality 
indicates that the scaling Pt(x) = f(x/y/i)/y/t is realized by the data - since 
J P t (x)dx — 1. This supports the BO approximation with a(v) = 0. 



5 Heston model in BO 

The probability of returns in the Heston model has been extensively studied 
|14j . In the Heston model, b(v) — n^/v and the stationary distribution is a 
Gamma distribution: 

m = d9) 

with a = 2j9/k 2 . In the Ito-scheme a(v) = v/2 and the above integral ([9]) can 
be determined analytically. The BO approximation corresponds to what ref. 
[2] calls the short time behaviour. Our basic assumption is that the volatility 
has already reached its stationary distribution. We obtain 




'''"■' = — (-77) \ — ' ' 2f <» ^i-^H <-''» 

with 

I67 

and K v {z) Bessel K-function. For large times / — * 1. For large positive x 
(extreme large wins on s) the Bessel function is dominated by an exponential, 

so 

P t (win \x) ^x n e-^r' (22) 



f=\l 1 + Z2j (21) 



and for large negative x (extreme losses) it is dominated by another exponential 

P t {loss) (x) ~x n e^ x . (23) 



Since / > 1 at all times it means that in this model losses tend to show a fatter 
tail (slower decrease) in x than wins. For short times, t < I67/K 2 this difference 
reduces. Note that e~? ±x = s - ^ is a power-law behavior in s, denoting by 
/± = (/±l)/2. 

In the case of a(v) = this model is also subject to the scaling Pt(x) — 
p(x/y/i)/y/t as the normal diffusion. Now the e~ x l 2 factor is not present and 
/ = Ay/ '7 / K 2 t in ea. (|20|) . The distribution Pt(x) becomes symmetric for wins 
and losses (relative to the mean trend). 
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Average log return DJ 1976-2006 
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Figure 3: The trend in log-returns with time lag t for Dow Jones data 1996- 
2006 (top) and the width of the detrended data showing a scaling compatible 
with the assumption a(v) — in the stochastic log return process (bottom). 
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6 Hull- White model in BO 

We now turn our attention to the probability distribution function resulting 
from the Hull- White stochastic volatility model in the BO approximation. To 
our knowledge this subject has not been discussed in the literature. 

In the Hull- White model b = nv and the balanced volatility follows a 
Gamma-distribution in the reciprocal variable y — l/v, the so-called inverse- 
Gamma distribution: 

a M = WTv»* e - S,y (24) 

with P = 2j/k 2 . The BO-approximated x distribution in the special scheme 
with a(v) = becomes: 

P t (x) = -= / dy Ii{y)^e--y. (25) 



'o 

Regarding the quantity E — x 2 /2t as an abstract "energy" the above formula 
represents a Boltzmann-Gibbs energy distribution under the influence of fluc- 
tuating "temperature" , which is identified with the variance. In particular a 
Gamma-distributed inverse temperature is known to lead to a cut power-law 
(Tsallis, or t-Student) distribution in the energy variable: 

/ T 2 \ -(0+3/2) 

P t ( x )=N^,pet) + (26) 

with 

v ' V2ir(/? + i)r(i/2) v ' 

The quotient of T functions constitutes Bernoulli's Beta-function B(/3+ 1, 1/2). 
A Gaussian distribution with variance given by Ot is obtained in the f3 — > oo 
limit, which can be thought of as the limit of a deterministic volatility (k — > 0). 

This result is symmetric in win and loss percentages, i.e. for all positive and 
negative x values. As in the Heston model, asymmetries can be introduced by 
choosing a non-zero value of a(v). On the other hand, it is easy to see that the 
large x behavior is now a power-law in x, not in s. Nevertheless for large powers 
P the cut power law distribution becomes quite close to the exponential for small 
and intermediate arguments, so the difference may influence the extreme large 
x (s) values only. 

In Figure S] we show a fit of the Hull- White model in the BO approximation 
for daily returns of the DJIA index. We find (3 = 0.861 and 9 — 1.03. One can 
see that the resulting Tsallis distribution provides a much better fit compared 
to the usual Gaussian model for the log-returns. It is important to notice that 
the parameters for the Tsallis distribution are directly related to the Hull- White 
stochastic volatility model, providing a microscopic origin for this non-Gaussian 
distribution. 
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DowJones 1996-2006 N=2800 B=40tlag=1 s =1E-221 
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Figure 4: The distribution of normalized and detrended log-returns P\{x) for 
Dow Jones data 1996-2006 (top) and 1976-2006 (bottom). The Hull White 
model fit in BO approximation is given by In Pi (a;) = a — c* ln(l + bx 2 /2). This 
upon eas. (|26!27p results in the values (3 = 0.861 and 6 = 1.03 for the 1976-2006 
data (bottom). 
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7 Discussion and Conclusions 



It has been known for quite sometime that the distribution of returns are not 
Gaussian, as assumed by the SFM. What has been the subject of some debate 
in the literature is the nature of the processes that cause this deviation from 
normality. 

Our results have shown that the observed non-Gaussian, fat-tailed distri- 
bution of returns can be obtained microscopically by two coupled independent 
stochastic processes, one for the returns and the other for the volatility of the re- 
turns. In our analysis we assumed that the stochastic volatility process quickly 
reaches a stationary distribution and acts as a background for the return pro- 
cess. We then obtain the distribution of the returns by integrating over the 
instantaneous volatility that is distributed according to the stationary distri- 
bution. This is analogous to the Born-Oppenheimer approximation in Physics 
and we showed that it leads in this case to satisfactory results. In particular, 
we demonstrated that all models of stochastic volatility should exhibit a scaling 
relation in the time lag of zero-drift modified log-returns and we verified that 
the Dow- Jones Industrial Average index indeed follows this scaling from time 
lags of of 1 day up to 500 days, beyond which data becomes sparse. 

We provided a microscopic explanation for a Tsallis distribution of log- 
returns. The parameters of a microscopic Hull- White stochastic volatility model 
uniquely determine the parameters of the macroscopic Tsallis distribution. We 
estimated these parameters from a fit to 30 years Dow Jones index daily data, 
concluding that they do result in a much better agreement than the SMF Gaus- 
sian distribution. Since the scaling is observed in the data, this daily fit de- 
termines the distribution for all other time lags. The transition to Gaussian 
distributions for large time lags occurs because the fat tails are pushed away as 
the distribution broadens with time and the central region of the distribution 
is well described by a Gaussian distribution. This can be seen by expanding 
cq. ([26|) for large time lags, a well known fact in the literature [15] . 

In a more general approach one should attemp to reconstruct directly a(v) 
and b(v) from data, which is tantamount to reconstruct the best stochastic 
volatility model compatible with data. This attempt is analogous to deciphering 
the Planck distribution and hence the temperature for distant stars from their 
radiation spectra. In this respect the BO approximation used in the Hull- White 
model should be viewed as yet another particular model and as such it should 
be tested. This was pursued in this paper. 
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